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In  this  work  we  study  the  problem  of  minimum  energy  docking  maneuvers  between  two 
Floating  Spacecraft  Simulators.  The  maneuvers  are  planar  and  conducted  autonomously  in 
a  cooperative  mode.  The  proposed  guidance  strategy  is  based  on  the  direct  method  known 
as  Inverse  Dynamics  in  the  Virtual  Domain,  and  the  nonlinear  programming  solver  known 
as  Sequential  Gradient-Restoration  Algorithm.  The  combination  of  these  methods  allows 
for  the  quick  prototyping  of  near-optimal  trajectories,  and  results  in  an  implementable 
tool  for  real-time  closed-loop  maneuvering.  The  experimental  results  included  in  this 
paper  were  obtained  by  exploiting  the  recently  upgraded  Floating  Spacecraft-Simulator 
Testbed  of  the  Spacecraft  Robotics  Laboratory  at  the  Naval  Postgraduate  School.  A  direct 
performances  comparison,  in  terms  of  maneuver  energy  and  propellant  mass,  between 
the  proposed  guidance  strategy  and  a  LQR  controller,  demonstrates  the  effectiveness  of 
the  method. 

©  2014  IAA.  Published  by  Elsevier  Ltd.  All  rights  reserved. 


1.  Introduction 

In  a  previous  paper  [1],  we  have  introduced  a  method 
to  rapid  prototype  near-optimal  planar  proximity  maneu¬ 
vers.  It  was  based  on  the  coupling  of  a  direct  method,  the 
Inverse  Dynamics  in  the  Virtual  Domain  (IDVD),  and  a 
nonlinear  programming  solver,  the  Sequential  Gradient- 
Restoration  Algorithm  (SGRA).  The  maneuver  scenarios 
required  a  chaser  spacecraft  to  execute  a  planar  docking 
with  a  tumbling  non-cooperative  target;  both  minimum 
time  and  minimum  energy  maneuvers  were  considered. 
The  simulation  results  presented  showed  that  the  IDVD- 
SGRA  is  capable  of  computing  solution  trajectories  with 
CPU  time  of  the  order  of  a  second  (using  a  Thinkpad 
Lenovo  T500)  with  performance  index  values  very  close  to 
the  correspondent  values  obtained  from  the  optimal 
solution. 
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On  a  subsequent  work  [2]  the  same  method  has 
been  implemented  as  guidance  strategy  for  spacecraft 
translation-only  proximity  maneuvers  in  Clohessy-Wilt- 
shire  dynamics  with  path  constraints  capability.  Initially 
the  IDVD-SGRA  was  exploited  to  rapid  prototype  mini¬ 
mum  energy  trajectories  which  transfer  a  spacecraft  from 
a  given  initial  state,  described  in  terms  of  position  and 
velocity,  to  another  final  state  while  avoiding  a  keep-out- 
zone  located  along  the  path.  On  a  second  phase  the  IDVD- 
SGRA  has  been  implemented  on  a  closed-loop  fashion; 
namely,  along  the  trajectory,  the  spacecraft  position  com¬ 
ponents  and  velocity  components  are  updated  at  a  certain 
sample  frequency  and  the  IDVD-SGRA  is  reiterated  to 
compute  refreshed  values  of  the  thrust  components.  This 
closed-loop  implementation  allows  compensating  unpre¬ 
dicted  system  disturbances  such  as:  external  forces  acting 
on  the  spacecraft,  sensor  noise,  and  uncertainties  on  the 
system  (mass  change,  thrust  variation,  etc.). 

Other  successful  applications  of  the  IDVD  involve  the 
generation  of  6  DOF  minimum  time  and  minimum  energy 
rendezvous  trajectories  with  a  tumbling  object  [3],  and  3 
DOF  translation-only  docking  trajectory  for  an  orbiting 
spacecraft  [4]. 
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In  this  work,  we  exploit  experimentally  the  IDVD-SGRA 
as  near-optimal  guidance  strategy  for  the  autonomous 
cooperative  planar  docking  maneuvers  between  two  Float¬ 
ing  Spacecraft  Simulators  (FSS).  The  Floating  Spacecraft- 
Simulator  Testbed  [5,6]  is  an  on-ground  experimental 
facility  composed  of  a  set  of  four  autonomous  FSSs  capable 
of  flotation  (using  airpads)  over  a  high  precision  flat 
surface  (4  m  x  4  m);  a  set  of  eight  compressed  air  fed 
supersonic  thrusters  provides  the  thrust  and  torque 
required  to  maneuver  over  the  flat  floor. 

In  order  to  perform  cooperative  minimum  energy 
planar  docking,  the  IDVD-SGRA  has  been  implemented 
in  closed  loop  fashion  on  the  two  FSSs.  Each  FSS  will 
behave  as  a  chaser  which  must  dock  with  the  other  FSS 
(the  target).  While  maneuvering,  at  every  sample  time 
interval,  the  current  position  and  velocity  of  chaser  and 
target  are  updated  and  the  near-optimal  trajectory  is 
recomputed  to  attain  refreshed  values  of  the  controls  to 
be  commanded  on  the  subsequent  sample  interval.  All  the 
near-optimal  trajectories  computed  using  IDVD  are  intrin¬ 
sically  characterized  by  continuous  controls,  at  the  same 
time  the  SGRA  enforces  bounds  on  the  commanded  thrust, 
commanded  torque,  and  maximum  maneuver  time. 

On  a  second  phase  we  have  considered  a  different 
cooperative  docking  maneuver;  namely,  one  FSS  will  act 
as  a  chaser  trying  to  reduce  the  separation  distance  from 
the  target  which  instead  will  stay  at  rest  in  its  initial 
position  while  aiming  its  docking  interface  toward  the 
target. 

For  comparison  purposes,  the  same  set  of  docking 
maneuvers  has  also  been  executed  using  an  LQR  controller 
[7-11].  The  results  show  that  the  docking  trajectories 
performed  with  the  IDVD-SGRA  guidance  are  character¬ 
ized  by  a  lower  amount  of  total  maneuver  energy  and  fuel 
consumption  with  respect  to  the  LQR. 

2.  Inverse  dynamics  in  the  virtual  domain 

The  planar  floating  motion  of  the  FSS  is  described  by 
the  following  equations  (Fig.  1) 

Fx  =  mx, 

Fy  =  my, 

Fz  =  lz6,  (1) 


where  m  and  Iz  are  the  FSS's  mass  and  moment  of  inertia, 
Fx  and  Fy  are  the  thrust  components,  and  Tz  is  the  torque 
along  the  z-axis  which  is  orthogonal  to  the  xy-plane  and 
directed  in  accordance  to  the  right  hand  rule. 

Let  x0,y0,6 o,  x0,y0,90  and Xf,yf,9f,  Xf,yf,0f  be  the  FSS 
initial  and  final  states,  respectively.  The  reference  optimal 
trajectory  we  are  interested  in  is  the  solution  of  the 
following  Bolza-Pontryagin  optimal  control  problem: 
minimize  the  maneuver  energy 

E=\j\Fl  +  F]  +  Tl)dt  (2) 

with  respect  to  the  controls  FXf  Fy ,  Tz  in  order  to  verify  the 
differential  constraints  in  Eq.  (1),  the  inequality  constraints 

\Fx(t)  cos  (<9(t))  +  Fy(t)  sin  (0(t))|  <  Fmax,  (3) 

I  -Fx(t)  sin  (<9(t))  +  Fy(t)  cos  (<9(t))|  <  Fmax,  (4) 

|Fz(t)|  <  Tz  max,  (5) 

tf  <  tmax?  (6) 

and  the  boundary  conditions 

x(0)  =  x0,y(0)=y0, 0(0)  =  0o,x(t/)  =  xf,y(tf)=yf,  9(tf)  =  6f, 

(7) 

*(0)  =  *0,y(0)=y0, 0(0)  =  00,%)  =  xf,y(tf)=yf,  b(tf)  =  6f. 

(8) 

Eqs.  (3)  and  (4)  limit  the  thrust  components  in  body 
axis  to  the  maximum  thrust  available  Fmax,  Eq.  (5)  limits 
the  torque  commanded  to  the  maximum  torque  available 
Fz  max*  and  Eq.  (6)  limits  the  maneuver  time  to  a  pre¬ 
defined  maximum  value  tmax. 

The  goal  of  the  IDVD  is  to  convert  such  optimal  control 
problem  into  an  equivalent  nonlinear  programming  pro¬ 
blem;  namely,  the  problem  of  minimizing  the  performance 
index 

g  =  gOT,r2,...,rk)  (9) 

with  respect  to  a  set  of  varied  parameters  [ri,r2,...,rk], 
subject  to  the  constraints 

h(ri,r2,...,rk)  =  0.  (10) 

The  first  step  is  to  describe  the  FSS  trajectory  compo¬ 
nents  and  time  with  a  set  of  basis  functions  defined 
in  a  virtual  variable  domain  [12,13].  For  this  particular 
application,  these  components  (position  and  attitude)  are 
expressed  as  a  set  of  polynomials 

X(T)=  X  aiTl,y(T)=  X  biTl,0(T)=  X  CiTl,TE[0,Tf\,  (11) 
i  =  0  i  =  0  i  =  0 

where  r  is  an  independent  virtual  argument.  The  coeffi¬ 
cients  a„  bi ,  q  are  related  to  the  initial  and  final  conditions 
on  FSS  position,  velocity,  acceleration,  etc.  In  particular, 
they  can  be  chosen  in  such  a  way  that  these  boundary 
conditions  are  automatically  satisfied.  For  this  purpose,  the 
polynomials  degree  s  is  strictly  related  to  the  sum  of  initial 
and  final  conditions  that  need  to  be  imposed;  namely,  the 
following  inequality  must  hold  [2] 

s  >  Oq  +  Of  + 1 , 


Fig.  1.  Floating  Spacecraft  Simulator  reference  system. 


(12) 
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where  00  and  0/  are  the  time  derivative  higher  orders  of 
the  assigned  initial  and  final  conditions,  respectively.  For 
cooperative  docking  maneuvers  initial  and  final  conditions 
on  both  position  and  velocity  must  be  imposed,  that  is, 
s>  3.  The  final  choice  of  the  polynomial  degree  is  driven 
by  a  compromise  between  the  degrees  of  freedom  we 
want  to  introduce  on  the  trajectory  (in  terms  of  shape 
complexity)  and  computational  burden.  This  guidance 
application  requires  maximum  computational  speed  per¬ 
formances,  for  this  reason  we  have  chosen  the  polynomial 
of  the  lowest  possible  order  by  assuming 

x(r)  =  a0  +  a1T+a2T2  +  a3r3 
y(r)  =  b0  +  bi  r  +  b2r2  +  b3r3 

0(r)  =  Co  +  CiT  +  C2T2  +  C3T3,T  e  [0,t/].  (13) 


Solving  for  a0,  cq,  a2,  a3  we  obtain 


Oq  =  Xq,  cii  =  Xocfi ,  a2  = 


-3x0  -2x0T/di  +3Xf—xfTf(di  +2d2Tf) 


2x0+x0T/d1  -  2Xf  +xfTf(d\  +  2d2rf) 
Tf 


(21) 


Similarly,  by  imposing  the  boundary  conditions  on  the 
y-axis  y0,yf,y0,yf  and  rotational  axis  0o,Of,0o,bf,  we 
obtain  the  following  expressions  for  the  coefficients  b0 , 
hi,  b2,  b3,  Co,  Ci,  c2,  c3 


b0=y0,b1  =y0di,b2  = 


-3y0-2y0T/di  +3y;  -^(di  +2d2T/) 


,  2y0  +yoT/^i  —  2y/  +y/T/(d1  +  2d2Tf) 

b3  = - ? - 

Tf 


(22) 


As  regards  to  the  mapping  between  the  time  domain 
and  the  virtual  argument  domain,  it  is  also  expressed 
through  the  polynomial  expression 

t=  Z  dir1, re[0,rf].  (14) 

2  =  1 

In  this  case,  the  polynomial  order  p  affects  the  flex¬ 
ibility  on  the  speed  changes  along  the  trajectory  and 
computational  burden  as  well.  Previous  investigations 
[1,2]  have  shown  that  a  good  compromise  for  planar 
maneuvers  is  p  =  2 

t  =  d\T+d2T2,T  e  (15) 


36>o  -2d0Tfdi  +30f  —  0fTf(d\  +2  d2tf) 


Co  =  0o,  Ci  =  dodi ,  c2  = 

7 

20q  +  0oT/d\  —  2  Of  +  0fTf(d\  +  2d2Tf) 


63  = 


73 


(23) 


By  substituting  these  expressions  in  Eq.  (13)  we  achieve 
the  following  analytical  descriptions  of  the  position  com¬ 
ponents  and  attitude  in  the  virtual  domain 

x  =  x(x0,Xf,  x0 ,  Xf  ,d\,  d2 ,  Tf ,  t), 

y=y(y0,y/,yo,y/’  dud2,  *f,  0, 

0  =  0(Oo,Of  ,Oo->df ->d\,d2,Tf  ,t),  tg[  0,Tf].  (24) 


Note  that  an  analytical  description  of  the  trajectory  is 
attainable  once  the  coefficients  a„  bit  ch  d/  and  the  final 
value  of  the  virtual  variable  r/  are  known. 

2.1.  Boundary  conditions  and  dynamics  inversion 

As  mentioned  before,  the  coefficients  ait  biy  cz  are  related 
to  the  initial  and  final  conditions  on  position  and  velocity. 
From  Eq.  (7)  we  have 

x(0)  =  a0=  x0,x(rf)  =  a0  +  a,  rf  +  a2rj +a3rj=Xf.  (1 6) 

The  boundary  conditions  on  velocity  components  and 
angular  rate,  Eq.  (8),  cannot  be  directly  imposed  without 
some  introductory  considerations.  Let  the  symbol  prime 
“/M  denote  the  operator  derivative  with  respect  to  the 
virtual  variable  t.  It  results: 


Exploiting  the  derivative  rule  in  Eq.  (17),  we  obtain 
a  similar  expression  for  the  velocity  components  and 
angular  rate 

X  =  X(X0,Xf,  Xo,Xf,d-l,d2,Tf,r), 

y=y(y0^s^O’y^  di  -  d2,  rf,  o, 

6  =  6(Po,6f,6o,Of,d\,d2,Tf,T),  t  e  [0 ,Tf].  (25) 


As  regard  to  the  acceleration  component  x,  from 
Eq.  (17)  it  results 


„  _  dx  _  d 


dt  dt  \dt/d 


x'  \  _  dr  d  /  x'  \ 
It /dr)  ~  dt  dr  ydt/dzj 


Similarly,  for  y  and  0  we  get 
1 


«  dy 

y=^  = 


dt  (dt/dr)2 


.dh 

1 dt2 


1  „  ■  rt 

(dt/dr)2  y  Xdr2J- 

(26) 

(27) 


,  dx  dx  dt  .  dt 

(17) 

dr  dt  dr  dr  ’ 

where 

x!  —  a\  +2a2r+3a3r2, 

(18) 

dt  _ 

—  =d\  +2  d2r. 
dr 

(19) 

The  boundary  conditions  on  velocity  components  then 
become 


x'(0)  =  a\  =  x0di ,  x'(Tf)  =  a\  +2a2r/+3a3r2  =xf(d-i  +2  d2tf). 

(20) 


1 

(dt/dr)2 


(28) 


This  procedure  allows  us  to  attain  analytical  expres¬ 
sions  in  the  virtual  domain  also  for  the  acceleration 
components  and  angular  acceleration 

X  =  X(X0,Xf,  X0,Xf,  di ,  d2,  Tf,  r), 
y =y(yo,yf,yo,yf,di,d2,rf,r), 

0  =  0(00, 0fi  00,  Of,  di ,  d2,  Tf,  r),  T  e  [0,  rf].  (29) 


The  main  result  of  this  process  is  that,  through  the 
inversion  of  the  FSS  dynamics 

Fx  =  mx, 
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Fy  =  my, 

Tz  =  lz0,  (30) 

We  have  obtained  the  analytical  expressions  in  the  virtual 
domain  of  the  controls  acting  on  the  FSS 

Fx  =  Fx(m,  x0,xf,x0,xf,  dud2,  rf,  t), 

Fy  =  Fy(m,y0,yf,y0,yf,  dud2,  rf,  r), 

Tz  =  Tz(lz,6o,6f,do,6f,  d\,d2,Tf,T),  re[0,rf].  (31) 

These  expressions  involve  a  set  of  given  parameters 
m,Iz,Xo,yo,Oo,Xo,yo,0o,  Xf,yf,0f,kf,yf,0f  a  set  of  varied 
parameters  d\,d2,Tf  and  the  virtual  argument  t. 

2.2.  Nonlinear  programming  problem 

As  mentioned  before,  for  the  cooperative  planar  dock¬ 
ing,  the  optimization  criterion  chosen  is  the  maneuver 
energy 

E=lf\F2x+F2y  +  T2z)dt.  (32) 

In  virtue  of  Eq.  (17)  and  exploiting  Eq.  (31)  we  can 
convert  this  integral  in  the  time  domain  into  an  integral  in 
the  virtual  domain 

1  rrf  n  t  0  dt  l  rTr  n  t  t 
E=2Jo  iFx  +  Fy  +  Tz)Ttdr=2j0  (Fx+Fy  +  T2z)(di+2d2r)dt, 

(33) 

which  implies 

E  =  E(m,  Iz, x0,y0, 00,x0,y0, 0o,xf,yf,  0f, xf,yf,  0f,  dx ,  d2,  rf).  (34) 

As  regard  to  the  bounds  on  the  controls  in  Eqs.  (3)-(5), 
the  eight  microthrusters  on  board  of  the  FSS  can  produce 
thrust  components  and  torque  subject  to  the  following 
constraints 

\F*B(t)\  ^  Fmax, 

\FyM  ^  Fmax, 

\Tz(t)\  <  Tz  max,  t  e  [0,  tf]  (35) 

where  Fx„(t)  and  FyB(t)  are  the  thrust  components  pro¬ 
jected  in  body  axis 

FXB(t)  =  Fx(t)  cos  (0(t))+Fy(t)  sin  (0(t)), 

FyB (t)  =  —  Fx(t)  sin  (d(t)) + Fy(t)  cos  (d(t)).  (36) 

Let  us  discretize  the  virtual  domain  [0,t/]  in  m  +  1 
equidistant  nodes  0  =  t0  <  <  •••  <  rm  =  t/  such  that 

rj=^Tf,  j e  {0, 1,2, m},  (37) 

to  which  correspond,  in  the  time  domain,  the  nodes 
0  =  t0<ti  <-<tm  =  tf 

tj  =  d\Tj  +  d2rj ,  j  g  {0, 1,2,  ...,m}.  (38) 

An  indirect  enforcement  of  the  inequalities  in  Eq.  (35)  is 
obtained  introducing  the  slack  variables  eFx,  eFy ,  eTz  and 
imposing  the  equalities 

max((Fx(rj)  cos  (0(tj  ))+Fy(rj)  sin  (<%  )))2) + ejx  =  F2max, 
max((  -  Fx( tj)  sin  (6»(rj)) +Fy(i })  cos  (0(tj)))2)+ ejy  =  F^ax, 
max((Tz(Tj))2)+e|z  =  Tj max,  je  {0, 1,2,  (39) 


It  is  very  important  to  note  that  the  maxima  in  Eq.  (39) 
happen  to  be  enforced  in  correspondence  of  the  virtual 
time  nodes  that  are  fractions  of  xf  (see  Eq.  (37)).  In  this 
way,  it  is  possible  to  relate  those  equality  conditions  to  this 
varied  parameter.  Needless  to  mention,  this  method  does 
not  provide  a  rigorous  enforcement  of  the  constraints,  in 
fact  the  inequalities  in  Eq.  (29)  will  be  guaranteed  only  in 
proximity  of  the  actual  maxima.  Obviously,  higher  number 
of  nodes  will  imply  more  accurate  enforcement  of  the 
bounds  but  with  the  disadvantage  of  a  higher  number  of 
numerical  evaluations.  For  practical  applications,  values  of 
m  ranging  between  20  and  100  provide,  usually,  satisfac¬ 
tory  results. 

The  inequality  constraint  on  the  maneuver  time 

tf  <  tmax?  (40) 

can  be  enforced  by  introducing  the  slack  variable  et  and 
the  equality 

(d-iTf  +  d2Tf)2  +  e^  =  t^ax.  (41) 

We  can  finally  describe  our  nonlinear  programming 

problem  in  aforementioned  form,  that  is:  minimize  the 
performance  index 

g  =  g(rur2,...,rk)  (42) 

with  respect  to  a  set  of  varied  parameters  [ri,r2,...,rfe], 
subject  to  the  constraints 

h(r1,r2,...,rk)  =  0.  (43) 

In  this  case,  the  problem  becomes:  given  the  set  of 
given  parameters  m,Iz,xo,yo,0 0,  xo,yo,0o,xf,yf,0f,xf,yf,0f 
minimize  the  performance  index 

E  =  E(m,Iz,Xo,yo,0o,xo,yo,0o,xf,yf,0f,xf,yf,0f,dud2,Tf) 

(44) 

with  respect  to  the  varied  parameters  d\,d2,Tf,eFx,eFy, 
eTz,  et  subject  to  the  constraints 

max((Fx(Tj)  cos  (<%)) + Fy  (t j)  sin  (d(r y)))2) + ejx  =  F2max , 
max((  -  Fx(z j)  sin  Wr,)) + Fy(rj)  cos  (6»(r,)))2) + ejy  =  F^ax, 
max((Tz(Tj))2)+e2z  =  T2 max,  j e {0, 1, 2 
(d\Tf+d2rj)2+e2  =  t^ax-  (45) 


3.  Sequential  gradient-restoration  algorithm 


The  algorithm  chosen  to  solve  the  previous  optimiza¬ 
tion  problem  is  the  nonlinear  programming  version  of  the 
SGRA  [14].  SGRA  is  based  on  two  scalar  quantities;  the 
constraint  error  P  =  hTh  and  the  optimality  condition  error 
Q  =  GrTGr  (see  Eqs.  (9)  and  (10)),  where 

G  =  g+r|Th  (46) 


is  the  augmented  function  associated  to  the  performance 
index  g  and  the  constraints  function  h,  and 


Cr  = 


dG_  dG_ 


(47) 


is  the  gradient  of  the  augmented  function  with  respect  to 
vector  of  varied  parameters  [ri,r2,...,rk].  In  Eq.  (46),  rj  is  the 
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Lagrange  multiplier  vector  associated  with  the  constraints 
function  h. 

SGRA  involves  a  cyclical  scheme  whereby  first  the 
constraints  are  satisfied  to  a  prescribed  accuracy  (restora¬ 
tion  phase);  then,  using  a  first-order  gradient  method,  a 
step  is  taken  toward  the  optimal  direction  to  improve  the 
performance  index  (gradient  phase).  Note  that,  in  this 
implementation,  Eqs.  (35)  and  (40)  are  satisfied  at  the 
end  of  each  restoration  phase.  The  convergence  tolerances 
for  each  phase  are  expressed  as  follows: 

P<e  i  (48) 

for  the  restoration  phase  and 

(l  <e2  (49) 

for  the  gradient  phase,  with  e t  and  e2  small  preselected 
positive  constants. 

The  qualifier  “sequential”  is  most  appropriate  for  this 
algorithm  due  to  the  successive  application  of  either  a 
gradient  iteration  or  a  restorative  iteration  depending  on 
the  values  of  the  scalar  quantities  P  and  Q.  Specifically, 
three  cases  can  occur: 

(Cl)  If  P>e i,  SGRA  executes  a  restorative  iteration 
leading  to  the  decrease  of  the  constraint  error. 

(C2)  If  P<£ i  but  Q>£2 ,  SGRA  executes  a  gradient 
iteration  leading  to  the  decrease  of  the  augmented 
function. 

(C3)  If  P<  £]  and  Q.<  e2»  convergence  is  declared  and 
the  algorithm  stops. 

In  general,  achieving  specified  final  conditions  in 
a  feasible  way  is  more  important  than  obtaining  an 
exact  optimal  solution:  so,  the  constraint  requirements 
(restoration  phase)  are  stricter  than  the  optimality  condi¬ 
tion  requirements  (gradient  phase),  that  is  £\«£2 • 

4.  Cooperative  docking  trajectories 

The  IDVD-SGRA  method  computes  near-optimal  tra¬ 
jectories  which  transfer  the  FSS  from  some  given  initial 
position  and  velocity  to  some  given  final  position  and 
velocity. 

Let's  indicate  with  FSS  A  and  FSS  B  the  two  vehicles  and 
with  xAo,yAo  and  xBo,yBo  their  center  of  mass  initial  posi¬ 


tions,  respectively.  At  the  beginning  of  the  maneuver  both 
are  at  rest  with  zero  initial  attitude  and  the  docking 
interfaces  oriented  toward  the  negative  direction  of  the 
x-axis  as  shown  in  Fig.  2.  The  initial  conditions,  for  the  two 
FSSs,  are  then  the  following: 

FSS  A 

xA(0)  =  xAo,yA(0)  =yAo,0A(O)  =  0, 

xA(0)  =  0,yA(0)  =  0,  0A(O)  =  0.  (50) 

FSS  B 

xB(0)  =  xBo  ,yB(0)  =  yBo ,  0A(O)  =  0, 

xB(0)  =  0,yB(0)  =  O,0B(O)  =  0.  (51) 

As  mentioned  before,  two  different  type  docking  execu¬ 
tions  have  been  investigated,  this  implies  that  two  differ¬ 
ent  set  of  final  conditions  must  be  imposed: 

(D1 )  During  the  maneuver  both  FSSs  behave  as  chasers. 
In  this  case  the  common  goal  is  to  reduce  the  separa¬ 
tion  distance  from  the  other  FSS  and  to  complete  the 
maneuver  with  the  matching  final  attitude.  In  order  to 
achieve  the  docking  configuration  it  is  necessary  that 
the  following  final  conditions  are  satisfied: 

FSS  A 

xA(tf)  =  xBf-d  cos  (0Bf  +  jt), 
yA(tf)  =  yBf-d  sin(0B/+/r), 

Pa  (£/)  =  0Bf  +  n, 

*A(  tf)  =  0  ,yA(tf)  =  0, 0A(tf)  =  0.  (52) 

FSS  B 

xB(t/)  =  xA( tf)  -  d  cos  (PB(tf)), 
yB(f/)  =yA(f/)  -  d  sin  (PB(tf)), 

0B  (tf)  =  0A(tf)  +  7T, 

xB(t/)  =  0,yB(tf)  =  0, 0B(tf)  =  0,  (53) 

where  d  is  the  distance  between  the  two  FSS's  center  of 
mass  in  docked  configuration  (Fig.  2).  The  need  to  add  n 
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Fig.  2.  Planar  docking  execution  schemes. 


372 


M.  Ciarcia  et  al.  /  Acta  Astronautica  102  (2014)  367-377 


in  the  expressions  of  0A(tf)  and  0B(tf)  is  due  to  the  fact 
that  the  docking  interface  is  located,  in  both  vehicles, 
on  the  opposite  side  from  where  the  attitude  is  mea¬ 
sured  (Fig.  1). 

(D2)  In  this  case  only  FSS  A  behaves  as  chaser  trying  to 
minimize  its  distance  from  FSS  B  which  instead  will 
keep  its  initial  position  while  aiming  its  docking  inter¬ 
face  toward  the  chaser.  In  other  words  only  one  of  the 
two  vehicles  is  in  charge  to  null  the  separation  distance 
whereas  the  other  will  just  wait  and  modifying  its 
attitude  to  match  the  final  chaser  attitude.  In  this 
scenario  the  final  conditions  become 

FSS  A 

xA(tf)  =xBf-d  cos  (<9B/  +n), 
yA(tf)=yBf-d  sin  (0Bf  +ri). 

Ok  (£/)  =  0Bf  +n, 

xA(tf)  =  0  ,yA(tf)  =  0,  bA(tf)  =  0.  (54) 

FSS  B 

*B(t/)  =  *b0,  yB(f/) =yBo>  oB(tf)  =  eAf +n, 

xB(tf)  =  0,yB(tf )  =  0, 0B(tf)  =  0.  (55) 

It  is  important  to  note  that  the  initial  position  compo¬ 
nents  xAo,yAo,xBo,yBoare  given  parameters  whereas  the  final 
position  components  and  attitude  xAf,yAf,0Af,xBf,yBf,0Bf  are 
unknown  parameters.  As  we  will  explain  better  in  the 
following  section,  the  closed  loop  implementation  of  the 
IDVD-SGRA  will  allow  us  to  make  an  estimate  of  these  last 
set  of  parameters  during  the  docking  execution.  Such  esti¬ 
mation  will  become  more  accurate  toward  the  final  part  of 
the  maneuver  and  becomes  a  precise  guess  by  the  end  of  it. 

4.1.  IDVD-SGRA  closed-loop  implementation 

In  a  previous  paper  [  1  ]  we  have  tested  the  capability  of 
the  IDVD-SGRA  method  to  fast  prototype  near-optimal 
trajectories.  In  detail,  for  the  IDVD-SGRA  with  3rd  order 
polynomial  description  of  position  coordinates  and  atti¬ 
tude,  it  can  converge  to  the  solution  with  CPUtime  of  the 
order  of  hundredths  of  a  second.  This  performance  makes 
it  suitable  for  closed-loop  implementation  in  a  decreasing 
horizon  fashion.  During  the  maneuver,  at  each  sample 
time  interval  (0.5  s),  the  current  position  components  and 
velocity  components  of  chaser  and  target  are  updated, 


then  a  new  near-optimal  trajectory  is  recomputed  and 
refreshed  values  of  force  components  and  torque  are 
commanded  (Fig.  3).  This  process  is  reiterated  until  com¬ 
pletion  of  the  maneuver.  The  remaining  maximum  man¬ 
euver  time,  which  at  the  beginning  of  the  maneuver  is 
equal  to  tMAx»  will  decrease  progressively  while  the  man¬ 
euver  time  is  elapsing. 

In  this  scenario  it  becomes  clear  that,  at  each  sample 
time  interval,  the  initial  conditions  to  be  updated  in 
Eq.  (48),  for  FSS  A,  are  the  following: 

*a0  =  *c(U),yA  o  =yc(ti\oAo=Oc(ti), 

*Ao  =  xc(ti),yAo  =  yc(td,  bAo  =  bc(ti).  (56) 

As  regard  to  the  final  conditions  to  be  updated  in 
Eqs.  (50)  and  (52),  from  the  current  status  of  the  target 
xT(ti),yT(ti),  0T(ti),  xT(ti),yT(ti),  <9r(tj),we  can  estimate  the 
final  docking  configuration  of  the  target  (FSS  B)  by  assum¬ 
ing  the  following  linear  motion  propagation 

*B/(h)  =  XT(ti)+I<xT(ti)(t max  -  U ), 

yB/(h)  =yT(ti)+i<yT(ti)(tMAx  -  to,  (57) 

where  I<  is  a  nonnegative  correction  constant  which  we 
have  introduced  to  take  into  account  the  changeable 
behavior  of  the  target.  During  our  experimental  campaign, 
we  have  found  heuristically  that  I<=  0.6  produces  satisfac¬ 
tory  results.  Note  that  for  U^tMAx  we  have  xB/(t;)->xr  (q) 
and  yBf  (h)->yr  (h),  this  confirm  that  at  the  end  of  the 
maneuver  the  target  estimated  position  coincides  with  its 
actual  position.  An  identical  approach  has  been  used  to 
update  FSS  B's  initial  and  final  conditions. 

As  for  the  desired  final  attitude  %,  it  has  been  defined 
as  the  angle  between  the  x-axis  and  the  line  which  goes 
from  the  center  of  mass  of  FSS  A  to  the  center  of  mass  of 
FSS  B,  i.e.  if 

Ax  =  xBf(t0-xAf(t0, 

a y=yB/(ti)-yA/(ti), 

a  =  arctan  (^)  (58) 

it  results 

Ax>0,Ay>0or  Ax>0,  Ay<0 
Ax  <  0,  Ay  >  0 

Ax  <  0,  Ay  <  0  (59) 

Ax  =  0,  Ay  >  0 

Ax  =  0,  Ay  <  0 


xr(t0’  yrOO’  ^tOO’ 


xcttiX  ycOi), 

xc0i),  ycOi),  ®c0i) 


Fig.  3.  IDVD-SGRA  closed-loop  guidance  scheme. 
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A  similar  definition  has  been  exploited  to  define  the 
desired  final  attitude  0Af. 


Fig.  4.  The  Floating  Spacecraft-Simulator  Testbed  at  Spacecraft  Robotics 
Laboratory. 
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Fig.  5.  Floating  Spacecraft  Simulator. 


5.  Floating  Spacecraft-Simulator  Testbed 

Our  experimental  campaign  has  been  conducted  at  the 
recently  upgraded  Floating  Spacecraft-Simulator  Testbed 
[5,6]  (Fig.  4).  This  unique  on-ground  experimental  facility 
is  composed  of  a  set  of  four  FSSs  and  a  high  precision 
floating  surface  located  in  a  clean/low  reflective  environ¬ 
ment.  The  dynamics  of  the  FSS  on  the  flat  surface  repro¬ 
duce  closely,  in  2D,  the  weightlessness  and  frictionless 
conditions  of  the  relative  3D  orbital  flight. 

The  main  floating  surface  is  a  granite  monolith  with  the 
following  characteristics: 

•  Dimensions:  4  m  x  4  m  x  0.3  m 

•  Surface  precision  grade:  AAA 

•  Planar  accuracy:  +  0.127  x  10“ 2  mm 

•  Horizontal  leveling  precision:  <  0.01  deg 

•  Mass:  15.2  x  103  kg. 

The  testbed  is  also  provided  with: 

1.  VICON  tracking  system  (10  cameras,  tracking  accuracy 
lower  than  1  mm  for  the  entire  volume  of  the 
laboratory) 

2.  Linux  Real-Time  work  station 

3.  Ad-Hoc  WiFi  internal  network  for  data  streaming 

4.  High  pressure  air  compressor  and  compressed  air  filling 
station. 


5.1.  Floating  Spacecraft  Simulator 

The  Floating  Spacecraft  Simulator  is  (Fig.  5)  a  custom 
designed  autonomous  vehicle  capable  of  floatation  via  air- 
pads  and  actuated  by  a  set  of  eight  supersonic  microthrusters 
[15]  (two  for  each  side)  fed  with  compressed  air.  Each  vehicle 
is  provided  with  an  on-board  computer  running  a  real-time 
operating  system,  a  fiber  optic  gyro  for  attitude  measure¬ 
ments,  and  two  compressed  air  tanks  which  feed  the  airpads 
and  microthrusters.  The  two  Floating  Spacecraft  Simulators 
are  characterized  by  the  following  specifications: 

mA  =  9.48  kg, 
mB  =  9.91  kg, 

1=6.02  x  10"2  kg  m2  , 

I  =  6.14  x  10"2  kg  m2  , 
ddoc  =  0.34  m  , 

Fmax  =  0.12  N, 

Tz  max  =  1.44  X  10“2  N  m. 


Table  1 

Test  cases. 


Case 

*a0  (m) 

Ta0  (™) 

*B0  (rri) 

yBo  (™) 

m A0  (kg) 

mBo  (kg) 

OviAX  (S) 

1 

0.29 

3.74 

3.68 

0.35 

9.46 

9.89 

97 

2 

1.40 

3.69 

2.74 

0.29 

9.42 

9.84 

80 

3 

0.41 

3.60 

3.77 

1.85 

9.42 

9.84 

84 

4 

1.44 

3.57 

3.38 

1.34 

9.43 

9.83 

79 
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6.  Test  cases 

Four  test  cases  have  been  considered  in  our  experi¬ 
ments.  In  Table  1  are  shown  the  initial  positions  and  the 
initial  masses  of  the  two  FSSs.  To  evaluate  the  IDVD-SGRA 
performances,  for  both  types  docking  execution  D1  and 
D2,  the  same  maneuvers  have  been  performed  using  an 
LQR  controller  [7-11].  Since  such  controller  does  not 
enforce  any  constraint  on  the  maximum  maneuver  time, 
in  order  to  ensure  that  this  parameter  is  equal  for  all  the 
controllers,  we  have  first  executed  the  dockings  with  the 
LQR  then  we  have  set  the  resulting  maneuver  times  as 
tMAx  for  the  IDVD-SGRA  (Table  1). 

7.  Results 

In  Table  2  are  shown  the  global  results  in  terms  of  total 
maneuver  energy  ETOt  and  propellant  mass  mPr0p  tot-  Those 
quantities  refer  to  the  maneuver  energy  and  propellant  mass, 
respectively,  required  by  both  vehicles.  For  comparison 
purposes,  we  will  consider  as  reference  the  total  energy 
and  propellant  mass  required  by  the  LQR  controller. 

These  results  show  that  the  total  maneuver  energy 
required  by  the  IDVD-SGRA  guidance  strategy  is  about 
one-sixth  of  the  energy  required  by  the  LQR.  In  terms  of 
fuel  consumption,  the  IDVD-SGRA  was  able  to  complete 


most  of  the  dockings  maneuvers  using  less  than  one-third  of 
propellant  mass  consumed  by  the  LQR.  From  Table  2  we 
observe  that  a  maneuver  characterized  by  a  higher  amount 
of  control  energy,  with  respect  to  another,  may  not 
correspond  to  an  increase  in  propellant  mass  (e.g.  Case 
2,  IDVD-SGRA  D1  and  IDVD-SGRA  D2).  This  is  due  to  the 
intrinsic  nature  of  the  Pulse  Width  Modulator  which  is 
used  to  convert  the  continuous  commanded  controls  into 
equivalent  sequence  of  fixed-thrust  impulses.  In  fact, 
very  low  values  of  commanded  thrust  or  torque,  which 
provide  positive  contribution  to  the  maneuver  energy, 
may  not  be  representable  as  a  sequence  of  impulses  due 
to  the  physical  limitation  of  the  actuators,  namely  the 
inertia  of  solenoid  valves.  This  implies  that  those  con¬ 
tribute  to  the  computed  control  energy  do  no  contribute 
to  the  fuel  mass  expenditure. 

In  Figs.  6-9  are  represented  the  bird's-eye  views  of  the 
docking  maneuvers  for  all  the  cases.  These  plots  show  that 
trajectories  performed  with  the  LQR  guidance  follow  a 
path  which  connects  smoothly  the  initial  positions.  The 
alignment  of  the  docking  interfaces  is  achieved  and  main¬ 
tained  for  a  long  portion  of  time  prior  to  the  end  of  the 
maneuver.  This  continuous  control  of  the  trajectory 
requires  an  action  on  the  thrusters  which  is  expensive  in 
terms  of  fuel  consumption.  Conversely,  the  IDVD-SGRA 
guidance  executes  the  maneuver  so  that  the  docking 


Table  2 

Global  results.  Maneuver  energy  and  fuel  consumption. 


Case 

Control  strategy 

Etot  ( N2s ) 

Ftot  (%) 

mpropTOT  (g) 

mProp  TOT  (%) 

1 

LQR 

2.44 

100 

75 

100 

IDVD-SGRA  D1 

0.43 

18 

31 

41 

IDVD-SGRA  D2 

0.17 

7 

17 

23 

2 

LQR 

2.29 

100 

71 

100 

IDVD-SGRA  D1 

0.33 

14 

19 

20 

IDVD-SGRA  D2 

0.29 

13 

22 

30 

3 

LQR 

2.51 

100 

76 

100 

IDVD-SGRA  D1 

0.25 

10 

14 

18 

IDVD-SGRA  D2 

0.28 

11 

8 

11 

4 

LQR 

2.34 

100 

70 

100 

IDVD-SGRA  D1 

0.23 

10 

18 

26 

IDVD-SGRA  D2 

0.34 

14 

10 

14 

LQR  IDVD-SGRA  D1  IDVD-SGRA  D2 


x  (m)  x  (m)  x  (m) 

Fig.  6.  Case  1.  Bird's-eye  views. 


M.  Garcia  et  al.  /  Acta  Astronautica  102  (2014)  367-377 


375 


LQR 


x(m) 


IDVD-SGRA  D1  IDVD-SGRA  D2 


Fig.  7.  A  Case  2.  Bird's-eye  views. 


LQR 


x  (m) 


4 

3.5 
3 

2.5 
2 

1.5 
1 

0.5 

0 


IDVD-SGRA  D1 


IDVD-SGRA  D2 


Fig.  8.  Case  3.  Bird's-eye  views. 
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Fig.  9.  Case  4.  Bird's-eye  views. 


IDVD-SGRA  D2 


configuration  is  achieved  only  at  the  end  of  it.  This  strategy 
implies  a  more  efficient  bang-off-bang  type  of  behavior  on 
the  control  time  history.  Limited  actions  on  the  controls 
are  commanded  only  when  consistent  trajectory  correc¬ 
tions  are  required.  These  differences  are  confirmed  by 
looking  at  the  thruster  usage  shown  in  Figs.  10-12.  Those 


refer  to  the  FSS  A’s  thrusters  impulses  time  histories 
obtained  for  all  the  three  docking  strategies  for  Case  4. 
During  the  last  thirty  seconds  of  the  maneuver  the  LQR 
guidance  expends  a  considerable  action  of  the  thrusters 
(see  Fig.  10,  whereas  IDVD-SGRA  D1  and  IDVD-SGRA  D2 
require  minimum  control. 
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Fig.  10.  Case  4.  LQR  docking.  FSS  A  thrusters  impulses  time  histories. 
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Fig.  11.  Case  4.  IDVD-SGRA  D1  docking.  FSS  A  thrusters  impulses  time 
histories. 


8.  Conclusion 

In  previous  works  we  have  successfully  applied  the 
IDVD-SGRA  to  rapid  prototype  near-optimal  trajectories 
and  as  near-optimal  guidance  for  spacecraft  proximity 
maneuvers.  The  IDVD  is  a  direct  method  capable  of 
converting  an  optimal  control  problem  into  a  nonlinear 
programming  problem,  whereas  the  SGRA  is  a  first-order 
solver  capable  of  robust  and  quick  convergence 


Fig.  12.  Case  4.  IDVD-SGRA  D2  docking.  FSS  A  thrusters  impulses  time 
histories. 


performances.  In  this  paper  we  describe  the  experimental 
validation  of  the  IDVD-SGRA  as  near-optimal  guidance 
strategy  for  cooperative  minimum  energy  planar  docking 
maneuvers.  A  set  of  docking  scenarios  has  been  designed 
and  experimented  at  the  Floating  Spacecraft-Simulator 
Testbed.  One  docking  scenario  requires  each  Floating 
Spacecraft  Simulator  to  behave  as  a  chaser  which  must 
dock  with  the  other  target  vehicle.  Subsequently,  we  have 
designed  a  docking  scenario  in  which  only  one  Floating 
Spacecraft  Simulator  behaves  as  a  chaser  reducing  the 
distance  from  the  target  which  instead  remains  in  its 
initial  position  while  aiming  its  docking  interface  toward 
the  chaser.  For  comparison  purposes,  the  same  set  of 
docking  maneuvers  has  been  performed  also  using 
a  different  controller  such  as  a  Linear  Quadratic  Regulator. 
The  presented  results  show  that  the  closed-loop  imple¬ 
mentation  of  the  IDVD-SGRA  is  capable  of  executing 
planar  docking  maneuvers  with  a  considerable  lower 
amount  of  energy  on  the  controls  and  propellant  mass. 
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